AMFuse: Add–Multiply-Based Cross-Modal Fusion Network for Multi-Spectral Semantic Segmentation

نویسندگان

چکیده

Multi-spectral semantic segmentation has shown great advantages under poor illumination conditions, especially for remote scene understanding of autonomous vehicles, since the thermal image can provide complementary information RGB image. However, methods to fuse from and are still under-explored. In this paper, we propose a simple but effective module, add–multiply fusion (AMFuse) fusion, consisting two math operations—addition multiplication. The addition operation focuses on extracting cross-modal features, while multiplication concentrates common features. Moreover, attention module atrous spatial pyramid pooling (ASPP) modules also incorporated into our proposed AMFuse modules, enhance multi-scale context information. Finally, in UNet-style encoder–decoder framework, ResNet model is adopted as encoder. As decoder part, obtained hierarchically merged layer-by-layer restore feature map resolution segmentation. experiments RGBT multi-spectral salient object detection demonstrate effectiveness fusing

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Medical Image Segmentation Based on Multi-Modal Convolutional Neural Network: Study on Image Fusion Schemes

Image analysis using more than one modality (i.e. multi-modal) has been increasingly applied in the field of biomedical imaging. One of the challenges in performing the multimodal analysis is that there exist multiple schemes for fusing the information from different modalities, where such schemes are application-dependent and lack a unified framework to guide their designs. In this work we fir...

متن کامل

Probabilistic Information Fusion for Multi-Modal Image Segmentation

Observable evidence from disparate sources are combined coherently and consistently through a hierarchically structured knowledge tree. Prior knowledge of spatial interactions is modeled with Markov Random Fields. A posteriori probabilities of segmentations are maintained incrementally. This paper is a shortened version of [Chou and Brown 87], which contains many more references.* 1. A Probabil...

متن کامل

Effective Fusion of Multi-Modal Remote Sensing Data in a Fully Convolutional Network for Semantic Labeling

In recent years, Fully Convolutional Networks (FCN) have led to a great improvement of semantic labeling for various applications including multi-modal remote sensing data. Although different fusion strategies have been reported for multi-modal data, there is no in-depth study of the reasons of performance limits. For example, it is unclear, why an early fusion of multi-modal data in FCN does n...

متن کامل

Multi-modal Semantic Place Classification Multi-modal Semantic Place Classification

The ability to represent knowledge about space and its position therein is crucial for a mobile robot. To this end, topological and semantic descriptions are gaining popularity for augmenting purely metric space representations. In this paper we present a multi-modal place classification system that allows a mobile robot to identify places and recognize semantic categories in an indoor environm...

متن کامل

Flexible Multi-modal Graph-Based Segmentation

This paper aims at improving the well-known local variance segmentation method by adding extra signal modi and specific processing steps. As a key contribution, we extend the uni-modal segmentation method to perform multi-modal analysis, such that any number of signal modi available can be incorporated in a very flexible way. We have found that the use of a combined weight of luminance and dept...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Remote Sensing

سال: 2022

ISSN: ['2315-4632', '2315-4675']

DOI: https://doi.org/10.3390/rs14143368